Scaling up the Accuracy of K -nearest-neighbour Classifiers: a Naive-bayes Hybrid
نویسندگان
چکیده
k-nearest-neighbour (KNN) has been widely used as an effective classification model. In this paper, we summarize three main shortcomings confronting KNN and then single out three categories of approaches for overcoming its three main shortcomings. After reviewing some algorithms in each category, we presented a hybrid algorithm called dynamic k-nearest-neighbour naive Bayes with attribute weighting (simply DKNAW) by combining three improved approaches. We conduct extensive empirical comparison for the related algorithms in four groups, using the whole 36 UCI data sets selected by Weka. In the first three groups, we compare some algorithms in each category accordingly. In the forth group, we compare our hybrid approach to each single approach. At last, we discuss some directions for our future work on KNN
منابع مشابه
Performance Evaluation of Multistage Classifier
Ensemble of classifiers is one of the most researched methods in pattern classification in recency. It’s a well-known fact that multiple phases for evaluation provides more accuracy. In this paper we proposed a multistage classifier approach where we are applying three supervised classifiers for the classification in pattern recognition. Three Classifiers are Multilayer Perceptron (MLP), K-Near...
متن کاملScaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid
Naive-Bayes induction algorithms were previously shown to be surprisingly accurate on many classii-cation tasks even when the conditional independence assumption on which they are based is violated. However , most studies were done on small databases. We show that in some larger databases, the accuracy of Naive-Bayes does not scale up as well as decision trees. We then propose a new algorithm, ...
متن کاملGenerating Estimates of Classification Confidence for a Case-Based Spam Filter
Producing estimates of classification confidence is surprisingly difficult. One might expect that classifiers that can produce numeric classification scores (e.g. k-Nearest Neighbour or Naive Bayes) could readily produce confidence estimates based on thresholds. In fact, this proves not to be the case, probably because these are not probabilistic classifiers in the strict sense. The numeric sco...
متن کاملAttack Type Prediction Using Hybrid Classifier
Due to the rapid increase in terrorist activities throughout the world, there is serious intention required to deal with such activities. There must be a mechanism that can predict what kind of “attack types” can happen in future and important measures can be taken out accordingly. In this paper, a hybrid classifier is proposed which consists of some existing classifiers including K Nearest Nei...
متن کاملComparison of Classification Methods: Peril to Avoid for Binary and Multi Propose Combination Approach
ABSTRACT: Classification plays an important role in various fields like Object recognition, text categorization etc. Studying classifiers for purpose of estimating probability for a ce is crucial for classification .In this paper, we present a survey of four k Nearest Neighbour, Naive Bayes and Neural Network focusing on their merits and demerits.We will also shed light on combination of the ab...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009